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ABSTRACT 

The development of multi-layer optics which allow to focus photons up to 100 keV and more promises 
an enormous jump in sensitivity in the hard X-ray energy band. This technology is already planned 
to be exploited by future missions dedicated to spectroscopy and imaging at energies >10 keV, e.g. 
Astro-H and NuSTAR. Nevertheless, our understanding of the hard X-ray sky would greatly benefit 
from carrying out contemporaneous polarimetric measurements, because the study of hard spectral 
tails and of polarized emission often are two complementary diagnostics of the same non-thermal 
and acceleration processes. At energies above a few tens of keV, the preferred technique to detect 
polarization involves the determination of photon directions after a Compton scattering. Many authors 
have asserted that stacked detectors with imaging capabilities can be exploited for this purpose. If 
it is possible to discriminate those events which initially interact in the first detector by Compton 
scattering and are subsequently absorbed by the second layer, the direction of scattering is singled 
out from the hit pixels in the two detectors. In this paper we give the first detailed discussion of 
the sensitivity of such a generic design to the X-ray polarization. The efficiency and the modulation 
factor are calculated analytically from the geometry of the instruments and then compared with the 
performance as derived by means of Geant4 Monte Carlo simulations. 
Subject headings: X-ray — polarimetry — Compton scattering 



1. INTRODUCTION 

After the first pioneering experiments in the '70s 
(Novick et al. 1972; Weisskopf et al. 1976, 1978), only 
very few polarimetric measurements have been carried 
out in high energy astronomy (Coburn & Boggs 2003; 
Rutledge & Fox 2004; Willis et al. 2005; Kalemci et al. 
2007; Dean et al. 2008; Forot et al. 2008; Yonetoku et al. 
2011a). The main reason which prevented polarimetry 
to become a common tool also in this energy band was 
that even state-of-art instruments were able to measure 
the polarization of only the brightest X-ray sources. In 
the soft X-ray energy range, where grazing incidence 
optics were available, Bragg diffraction polarimcters al- 
lowed only for a modest quantum efficiency, whereas 
Thomson scattering polarimeters had a energy thresh- 
old mismatched with the energy band pass of the tele- 
scopes (Novick 1975). At higher energies, without the 
possibility to focus X-rays, Compton polarimeters re- 
quired a large collecting area and consequently the high 
background limited the sensitivity. As a result, no ded- 
icated polarimeters after the one on-board the OSO-8 
satellite were launched, with the exception of the small 
polarimeter Gamma-Ray Burst Polarimeter (GAP) re- 
cently launched on-board the Japanese satellite IKAROS 
to observe with a large field of view the prompt emission 
of Gamma Ray Bursts (Yonetoku et al. 2011b). 

Today polarimetry is a field of growing interest in high 
energy astrophysics (Bellazzini et al. 2010). At low en- 
ergy, gas detectors able to image the path of the pho- 
toelectron in low atomic number mixtures, e.g. Helium 
or Neon and dimethyl ether (DME), are a valuable al- 
ternative to Bragg diffraction and Thomson scattering 
polarimeters (Costa et al. 2001; Black et al. 2007; Bel- 
lazzini & Spandre 2010). The mission Gravity and Ex- 
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treme Magnetism SMEX (GEMS), a small explorer satel- 
lite planned to be launched in 2014, exploits these photo- 
electric polarimeters together with grazing incidence tele- 
scopes and promises to perform polarimetry of sources as 
faint as a few mCrab between 2 and 10 keV, with an enor- 
mous improvement in sensitivity with respect to OSO-8 
polarimeters (Black et al. 2010; Jahoda 2010). On the 
other hand, the recent development of multi-layer op- 
tics (Christensen et al. 1992; Pareschi & Cotroneo 2003) 
makes attractive to extend this energy range upward. 
The energy range of photoelectric polarimeters can be 
extended above 10 keV by using higher atomic number 
mixtures, like Argon and DME possibly at high pres- 
sure (Muleri et al. 2006; Soffitta et al. 2010), but above a 
few tens of keV Compton polarimeters (McConnell 2010) 
become more appealing because of the prevailing prob- 
ability of Compton scattering with respect to photoab- 
sorption. 

The design of a Compton polarimeter is of the great- 
est importance both to achieve the best sensitivity and to 
reduce the systematic effects. Indeed, there are a num- 
ber of very different proposals in the literature (see e.g. 
Krawczynski et al. 2011). In this paper we want to dis- 
cuss the performance of a particular design which, al- 
though not dedicated to polarimetry, can provide some 
polarimetric sensitivity contemporaneously with imaging 
and spectroscopy. As a matter of fact, many authors 
have asserted that such a design would endow next imag- 
ing/spectroscopy X-ray missions with the capability to 
detect also polarization from bright sources (Barret et al. 
2003; Gouiffes et al. 2008; Ferrando et al. 2010). The 
geometry of such an instrument will be described in Sec- 
tion 2, while the performance will be investigated with a 
full analytical treatment and with Geant4 simulations in 
Section 3 and in Section 4, respectively. 
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2. DESCRIPTION OF THE STACKED IMAGER 
POLARIMETER 

A common instrument layout dedicated to broad-band 
imaging in the focal plane of a X-ray telescope exploits 
two stacked pixelated detection planes designed to absorb 
photons at different energies (e.g. the imager camera on- 
board NHXM, Catalano et al. (2010), or the Wide Field 
Imager on-board the International X-ray Observatory, 
Stefanescu et al. (2010)). The first detector is usually 
made of Silicon and it is used to detect photons at lower 
energy (<1 to ~15 keV, depending on the thickness of the 
depiction layer), while harder photons, up to ^100 keV, 
are absorbed in the second detector, which is based on 
CdTe or CZT crystals. Although the largest part of the 
high energy photons passes through the first detection 
plane without interacting, a tiny fraction is absorbed or 
scattered by Silicon. For what concerns this paper, only 
the latter events are those interesting for measuring po- 
larization. If the energy deposit due to the scattering in 
the Silicon is above the detection threshold and, after the 
scattering, the photons are absorbed by the second de- 
tection plane, the direction of scattering can be singled 
out by joining the hit pixels in the two detectors (see 
Figure 1). 

Therefore, any instrument with two stacked imaging 
detectors has, besides imaging capabilities, also an in- 
trinsic sensitivity to polarization. To exploit it, one must 
distinguish among the events which release an energy 
deposit in both detectors, that we call "double events", 
those in which the photon underwent a scattering in the 
first detection plane and was absorbed in the second one. 
A first distinctive feature of such "good" events is that 
the signals from the two detection planes are basically 
contemporaneous and therefore the LEID and the HEID 
must be put into coincidence. The time window must be 
short enough to make negligible the probability of acci- 
dental coincidences due to, for example, background or 
pile-up and this may require a coincidence window of a 
few yus or less, and in any case a shaping time w /us. 
Another constraint for an event to be a good Compton 
double event is that the energy released in the first de- 
tector must be below a certain threshold corresponding 
to the maximum accepted scattering angle. The thresh- 
old is below a few keV for incident photons with energy 
less than 40 keV and reasonable values of the scattering 
angle. 

The implementation of these constraints poses a num- 
ber of requirements on the two detectors. Some of them, 
e.g. the fast read-out, arc quite severe for the Silicon- 
based detection plane. Nonetheless they seem to be in 
the range of the current technology based on Silicon Drift 
Detectors (Lechner et al. 2010). In the following we will 
not discuss in more detail the actual feasibility of the in- 
strument taken as an example in this paper, because our 
primary aim is to assess the intrinsic sensitivity to polar- 
ization of the stacked imager design. Therefore, we will 
naively assume a "toy model" including only the funda- 
mental components. The first pixelated detection plane, 
hereafter called Low Energy Imaging Detector (LEID), is 
made of Silicon, while the second stage is made of CdTe 
crystals and will be referred to as High Energy Imaging 
Detector (HEID). The characteristics assumed for the 
LEID and for the HEID are reported in Table 1 and 



they are in line with those actually required in modern 
mission proposals. We will also suppose that the time co- 
incidence between the two detectors is fast enough that 
the occurrence of spurious coincidences is negligible, as 
the background. Since the instrument discussed is de- 
signed to be used in the focal plane of a telescope, we 
will focus our attention in the energy range 20-100 keV, 
which is the interval where multilayer optics will be ap- 
plied in the next years. 



Table 1 

Characteristics of the stacked imager polarimeter assumed 
throughout this paper. 



Low Energy Imaging Detector (LEID) 


Material 


Silicon 


Thickness 


450 (im 


Area 


51.2x51.2 mm 2 


Pixels 


512x512 pixels, square pattern 


Pixel size 


100 /mix 100 /im 


High Energy Imaging Detector (HEID) 


Material 


CdTe 


Thickness 


2 mm 


Area 


51.2x51.2 mm 2 


Pixels 


256x256 pixels, square pattern 


Pixel size 


200 /mix 200 /tm 


Distance LEID/HEID 


2 cm 


Energy range 


20-100 keV 



In the following, it will be implicitly assumed that the 
Minimum Detectable Polarization, that is the upper limit 
to the polarization which can be statistically measured in 
a certain observation time and with a certain confidence, 
is fully representative of the sensitivity of the instrument. 
This approach is purely statistical and, consequently, it 
will neglect a very important requirement of any actual 
polarimeter, i.e. the reduction to a negligible level and, 
if this is not possible, the correction of any systematic 
effects which may result in a spurious polarized signal. 
Systematics may be severe for the polarimeter discussed 
here, because the probability of scattering in the first de- 
tection plane is tiny and therefore there are a number of 
small effects which may play a role. Without the inten- 
tion to be complete, systematic effects may arise from 
nonuniformitics in the pixel efficiency, energy threshold 
or geometry, or they may come out from pointing insta- 
bilities or anisotropic background. These issues must be 
extensively studied if dealing with the feasibility of the 
polarimeter design discussed in this paper, but, again, 
this is out of the scope of this work where we rather fo- 
cus on the "bulk" of the problem, that is the performance 
of the method in ideal conditions. 

3. SENSITIVITY TO POLARIZATION - ANALYTICAL 
TREATMENT 

Compton polarimetcrs detect the polarization of the 
incident radiation by measuring the photon scattering 
direction: a polarized signal causes a square-cosine mod- 
ulation to appear in the histogram of azimuthal direc- 
tions of scattering (hereafter modulation curve). Even 
if the incident radiation is completely unpolarized, the 
number of photons scattered per angular bin is Poisson- 
distributed around the mean value, and this always mim- 
ics a spurious modulation to some extent. The amount of 
this modulation at a certain confidence level translates in 
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Figure 1. Layout of the polarimeter studied in this paper. Polarization of photons is measured by selecting those events which are scattered 
by the first detection plane (named LEID, Low Energy Imaging Detector) and subsequently absorbed by the High Energy Imaging Detector 
(HEID). The scattering angle 8 and the azimuthal direction tp are singled out by joining the pixels hit in the LEID and in the HEID. The 



accepted events have to be scattered in the hollow cone defined by 6> m ; n and 6 n 



with < 



G [0, max ) and i 



< arctan ; 



a Minimum Detectable Polarization (MDP) and, by defi- 
nition, only a detection greater than MDP is statistically 
significant (Weisskopf et al. 2010). 

The MDP is calculated from the source and the back- 
ground fluxes in the selected energy range, F and B re- 
spectively, and for the 99% confidence level it is (Weis- 
skopf et al. 2010): 



MDP 



4.29 B + eF 



efiF 



ST 



(1) 



where T is the observation time, S the collecting area, 
e is the detector efficiency and (i the modulation fac- 
tor. This latter parameter is defined as the amplitude 
of the modulation curve when completely polarized and 
monochromatic photons are incident on the instrument: 



the sensitivity. To evaluate q, we need to calculate \x and 
e separately. 

3.1. Estimate of the modulation factor 

An estimate of the modulation factor can be computed 
from some very basic considerations, taking into account 
the geometry of the instrument and the angular depen- 
dence of the differential cross section of Compton scat- 
tering. The latter, for completely polarized photons inci- 
dent on a free electron at rest, is (Klein & Nishina 1929; 
Heitler 1954): 



an 



1 2 eP_ 

2 r °^ 



E 

e 7 



~E 



2 sin 2 6 cos 2 . 



(4) 



Mr. 



M n 



M = 



M max + M„ 



(2) 



where M max and M m j n are the maximum and the min- 
imum of the modulation curve, respectively. A higher 
value of n means that the instrument responds to polar- 
ized radiation with a larger modulation and the effect of 
statistical fluctuations is, in proportion, lower. 

In our toy model, we assumed that spurious double 
events and background are negligible and, therefore, the 
MDP is inversely proportional to the quality factor q = 



where 9 is the scattering angle, <f> is defined as the az- 
imuthal angle from the scattering direction to the polar- 

i 2 

ization vector of the incident photon and rn = -. 

" u 47reo mc-* 

is the classical electron radius. The ratio between the 
energy before and after the scattering E/E' is related to 
the scattering angle 9 by the Compton formula: 



E 
E 1 



l+e(l 



(5) 



where e is the energy of the incident photon in unit of 
electron mass, e = E/m e c 2 . Equation 4, by substituting 
Equation 5, becomes: 



MDP 



1 



4.29 



H^/e V FST 



(3) 



1 



1 + e (1 — cos ( 



In the following we will use the quality factor to optimize 
the sensitivity of the polarimeter: the larger q the higher 



2 sin 2 9 cos 2 . 



[l + e(l-cos0)f [l + e(l-cos0)]' 



(6) 
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In the following we will be interested only in the angular 
dependence of the differential cross section. Therefore 
we define the function Die, 9, 4) as: 



1 



ll + e(l-cos0) 
1 

[l + £(I-cos(9)] 3 
2 sin 2 9 cos 2 6 



(1 - cos 6*)]" 



(7) 



The first step to evaluate the modulation factor is to 
calculate the modulation curve, i.e. to make an his- 
togram of the azimuthal scattering directions for com- 
pletely polarized and monochromatic photons. We have 
to count how many photons are scattered in a certain 
azimuthal direction 4, regardless of the scattering angle 
9. The number of photons dN(e, 9, 4) scattered in the 4 
and 9 direction per unit of solid angle dfl at energy e is 
proportional to the differential cross section: 



dN(e, 9,4) = k D(e, 9, 4) sin 9A9&4 , 



(8) 



where n is a constant of proportionality and dfl = 
sin0d0d</>. Neglecting the probability of multiple scat- 
terings in the LEID, the direction measured by joining 
the hit pixels in the LEID and in the HEID will be that 
of scattering, and therefore the modulation curve is ob- 
tained by summing dN/d4 over the range of 9 values: 



M(e,(p,l 



mm i ^max y 



dN(e,6,t 



D(e,9,4)sin9d9 . (9) 



The limits of integration, i.e. 9 m i n and max , are at large 
fixed by the geometry of the system. Referring to Fig- 
ure 1 , and using the parameters in Table I , we have that 
in our case: 



I 

— arctan — sa 52 , 
2d 



(10) 



where I is the side of the HEID and d is the LEID-HEID 
distance, and m i n € [O,0 max ). We will discuss in Sec- 
tion 3.3 how more stringent limits on 9 affect the sensi- 
tivity to polarization. 

In the left hand side of Equation 9 we used the variable 
cp instead of 4 because the latter is the angle to the pho- 
ton polarization vector, the former is the angle to some 
axis of reference of the instrument. The two angles are 
related by 



(11) 



eventually derived by applying Equation 2: 



Mr, 



M„ 



M(e, <p = 0, min , 6> max ) - M(e, <p = tt/2, ( 



mm: w max j 



M(e, <p = 0, min , 9 max ) + M{e, <p = tt/2, 
" D( £) 0,7r/2) - D(s,6,0) 



D{e, 9, tt/2) + D(e, 0,0) 



sin0d0. 



(12) 



Equation 12 can be integrated employing standard 
techniques, but the resulting explicit expression is rather 
cumbersome and not easy to handle. Consequently, we 
study the behavior of ^(s,9 m - ln ,9 max ) numerically, by 
means of the Computer Algebra System Maxima 1 . In 
Figure 2 we report the modulation factor as a function 
of m in and m ax- As an example, we set the energy of 
incident photons at 30 keV but the qualitative behav- 
ior is fully representative, since the dependence of /i on 
the energy is rather weak in the range of our interest. 
The value of the modulation factor increases monotoni- 
cally with max and m ; n and, in principle, the maximum 
would reach nearly 100% for m i n , max ~ > 90°. However, 
this result is not relevant here because, if the scattering 
angle is close to 90°, the assumption of a single scattering 
in the LEID (which led to Equation 9) fails and so does 
our model. Moreover, any feasible layout of the polarime- 
ter discussed in this paper would require m i n and max 
to be constrained well below 90°. We limited the range 
of max and m i n between and 70° in Figure 2 to stress 
that our results are not applicable for m i n ,0 max — > 90°. 
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Figure 2. Behavior of the modulation factor as a function of 6 m j n 
and # ma x- The energy of the incident photons is 30 keV, but, within 
the energy range of our interest (20-100 keV), this does not affect 
the result significantly. We assumed that < 9 m i n < #max < 70° 
to stress that our results are not applicable for 8 mln , max — > 90°. 



where <po is the angle of polarization defined as the peak 
of the modulation curve. In the following we will assume 
ipo = and therefore the maximum of the modulation 
curve will be at ip = (or 4 — tt/2) and the minimum 
will be at ip — n/2 (or 4 — 0). The modulation factor is 



3.2. Estimate of the efficiency 
1 http://maxima.sourceforge.net/ 
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The efficiency e of detecting scattered photons has to 
take into account three main contributions: (i) the prob- 
ability of Compton scattering in the LEID, (ii) the pho- 
toabsorption efficiency in the HEID and (iii) that only 
a fraction of photons are actually scattered towards the 
HEID. We evaluate the first term, e^ att (-E), as the frac- 
tion of Compton scatterings among all of the interactions 
which occur in the LEID: 



[l-eiq>(-rf ot (E) P L x L )} 



(13) 

where fj,^ ot and /i,f ca tt are the total and the incoherent 
scattering attenuation coefficients for the LEID, p L is 
the density of the LEID material (Silicon, in our model) 
and x L its thickness. 

The second contribution to the efficiency takes into 
account the probability that the scattered photon is ab- 
sorbed in the HEID. We will assume in the following 
that the photoabsorption efficiency of the HEID is 100% 
in the energy range of our interest, i.e.: 



c abs 



i . 



(14) 



This assumption, although naive, will not affect our re- 
sults significantly. In effect, it is quite reasonable because 
the requirement for the HEID to have an high efficiency 
in the entire energy range is a heritage of its use as an 
imager. In the configuration assumed (see the Table 1) 
the x H = 2 mm thickness allows for the photoelectric ab- 
sorption of more that 80% of 100 keV photons incident 
on-axis. Also, the efficiency in the polarimetric mode is 
actually higher, because the scattered photons see the 
detector as having an effective depth x / cos 9. Claim- 
ing a 100% HEID efficiency allows both to simplify the 
following discussion and to derive the maximum possible 
sensitivity to polarization. 

The last contribution to the efficiency is due to the fact 
that, while photons are scattered over all the solid an- 
gle, only in a few cases their scattering direction crosses 
the HEID. Actually, we set a more stringent limit, i.e. 
we take only those photons scattered at angles between 
0min and # m ax, no matter the value of the azimuthal an- 
gle (f>. The fraction of accepted photons e c is calculated 
integrating dN(e, 9, 0) (see Equation 8): 



Cc(^; ^min? ^max) 



Je 



dN(e,6, 



dN(e, 9, <f) 



2ti 



D(e,9,<P)sm9d9 d</> 



/•27T pTT 



(15) 



D{e,6,cj)) sinfldf? 



uated from the three aforementioned contributions: 

= e scatt( £ i x l) X Cabs X e c(e, $min, #max) = 

= 4# [l-ex P (-^ ot(£ )/^)]x 
AHot l £ ) 

/ D(e,9,<p)sm9d6 d<p 
>o Je min 

D(e,6,<f>)smdd9 d(j) 



(16) 



where we used e = E/m e c 2 to express the dependence 
on the energy of the incident photon. 

As for the modulation factor, we likewise plot in Fig- 
ure 3 the behavior of e(e, 9 mia , # max ) as a function of 6* m ; n 
and 6* max for 30 keV photons. The qualitative result is 
rather simple: the efficiency is higher when the interval 
of accepted scattering angles is larger ((9 m ax — > 90° and 
$min — > 0). More remarkably, the absolute value of the 
efficiency is always lower that 1%. As a matter of fact, 
the value of e^, att is 1.6% for the assumed thickness of 
the LEID (450 /im) at 30 keV and the incomplete collec- 
tion of the scattered photons makes things worse. The 
low intrinsic efficiency is the most important deficiency 
in the design of a polarimcter based on stacked imaging 
detectors and it cannot be easily overcome. The only 
way is to increase the thickness of the depleted region of 
the LEID, but the value of e^ att is only 3.4% still in the 
case of a depleted region 1 mm thick, which is one of the 
largest values still technologically reasonable. Moreover 
the increase of the thickness would result in a larger back- 
ground and would spoil the performance of the LEID for 
the detection of faint sources. 
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Figure 3. Behavior of the efficiency as a function of £> m i n and 
#max for a LEID thickness of 450 /im. The energy of the incident 
photons is 30 keV, but, within the energy range of our interest 
(20-100 keV), this does not affect the result significantly. 



The efficiency of the polarimeter can be therefore eval- 
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3.3. Quality factor 

Equations 12 and 16 can be combined to obtain the 
quality factor q = [i\fe. Notably, we enclosed the depen- 
dence of q on the geometry of the detector in only three 
parameters, that are # m i n , # ma x and x L . This allows us 
to optimize the sensitivity of the polarimeter varying a 
very limited number of free quantities. 

The behavior of the quality factor at 30 keV with 
respect to 9 m - m and 6* max is reported in Figure 4 for 
x L = 450 /im. As expected, the sensitivity monotoni- 
cally increases with # max because both modulation factor 
and efficiency do. In the geometry assumed in Table 1, 
S max < 52° (cf. Equation 10) and therefore we freeze 
#max to 52° in order to maximize q. On the contrary, 
the dependence on # m i n is more complex. The interplay 
between the increasing modulation factor and the de- 
creasing efficiency originate a broad peak in the quality 
factor. This unexpected result is shown more clearly in 
Figure 4 where the dependence of q with respect to # m i n 
and energy is reported for 9 max = 52°. The peak is basi- 
cally independent on energy in the range on our interest, 
but depends on # max . A practical relation which linkes 
#max and the value of # m in which maximizes q is: 



Cf Meg] « 0.57 9 max [deg] 



(17) 



In our case, # max = 52° and therefore ~ 30°. We 

will use these values in the following as those which pro- 
vide the best sensitivity to polarization, but the choice 
to accept only events scattered with an angle larger than 
a certain threshold has also practical advantages. Events 
scattered at angles lower than 30° release in the LEID 
less than 0.23 keV at 30 keV according to the Compton 
formula (see Equation 5), and such a low energy deposit 
may be difficult to detect. 

4. MONTE CARLO SIMULATIONS 

The results discussed in the previous section are purely 
analytical. Moreover, they rely on some simplified phys- 
ical assumptions, e.g. in treating the Compton scatter- 
ing as from free electrons at rest. In the general case 
of the scattering inside a realistic material, however, the 
momentum distribution of the electrons and the binding 
effects in atoms will produce both a broadening of the 
Compton peak in the scattered photon spectrum and a 
suppression of the scattering at forward angles (Veigele 
& Tracy 1966; Bergstrom & Pratt 1997; Hubbell 1997). 
The incoherent scattering cross section thus becomes: 



den. 



= S F (x,Z) 



da 



KN 



dO 



(18) 



where Sf(x,Z) is the so-called scattering function 
and depends on the transferred momentum x w 
(E/hc) sin(0/2) and on the atomic number Z of the scat- 
tering atom. The efficiency for a given scattering polar 
angle 8 is therefore affected. The azimuthal angle <j>, on 
the contrary, can be assumed independent of the binding 
effects (Matt et al. 1996). Another effect to take into 
account is due to the "pixelization" of the detectors, and 
to the geometrical shape of the pixel itself. To make a 
meaningful comparison with a realistic case, we therefore 
developed a Monte Carlo simulator of our toy model for 
a stacked polarimeter (see Table 1), using the Geant4 
toolkit (Agostinelli et al. 2003), version 4.9.4. Accurate 



physics lists were employed for the various electromag- 
netic interactions, notably the Livermore low energy li- 
braries 2 that use the EPDL97 tabulated version of the 
scattering function (Hubbell et al. 1975). 

Geant4 simulations allow us to treat the physics of the 
scattering more accurately. Notwithstanding, we inten- 
tionally maintain also in Monte Carlo treatment some 
simplified assumptions in the design of the instrument. 
In particular, we do not apply any energy threshold to 
the event detection in the LEID and consider LEID and 
HEID as having an infinite energy resolution. We are 
aware that these parameters play an important role in 
the determination of a good estimate of the detection ef- 
ficiency, especially at low energy. In particular, any real- 
istic assumption on the energy threshold and on the spec- 
tral resolution would decrease the efficiency and hence 
the sensitivity to polarization. Nonetheless, this choice 
is in line with our primary goal, that is to argue on the 
intrinsic sensitivity to polarization of the stacked imager 
layout rather than to propose a feasible design and to 
discuss its sensitivity. 

4.1. Data Analysis 

For each event (i.e. for each primary photon generated 
in the simulation run), the output of the Geant4 Monte 
Carlo simulator is a list of the pixels having a non-zero 
energy deposit, with their location and deposited energy. 
Only double events which release an amount of energy 
simultaneously in both the LEID and the HEID are rel- 
evant to study polarization and are recorded. However, 
among them we must select the good events, which are 
those for which it is possible to reconstruct the direction 
of scattering, and therefore are useful to measure the 
polarization of the incident photon beam. Examples of 
double but not good events are those in which one of the 
detectors collects the fluorescence radiation emitted after 
the photoabsorption in the other detector or events which 
undergo multiple scatterings in different pixels, especially 
in the LEID. Another process which also produces double 
events is the photoabsorption in the few microns at the 
bottom surface of the LEID: in this case the photoelec- 
tron may escape from the LEID and it could be absorbed 
in the HEID. These events, albeit rare, must be removed 
because the direction of emission of the photoelectron 
is sensitive to polarization but the effect is opposite to 
that of the Compton scattering, i.e. the probability of 
emission is maximum along the polarization. In our sim- 
ulations we used a 30 /im thick beryllium plate between 
the LEID and the HEID to stop the large part of the 
photoelectrons but not X-rays (the transparency of the 
plate at 20 keV is 99.9% on-axis). However the configu- 
ration of such a shield could be conveniently adapted to 
specific needs in actual instruments, for example it could 
be a thin thermal shield if the LEID and the HEID must 
work at different temperatures. 

We defined some criteria to filter those events in which 
the photon underwent only two interactions, the first as 
a scattering in the LEID and the second in the HEID, 
restricting ourselves only to procedures which could be 
applied also to data collected by real instruments. The 
first filter is a spatial filter, and it is based on the fact 

2 https: / /twiki . cern.ch /twiki /bin / view / Geant4 / 
LowcMigratcdLivcrmorc 
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that good events must undergo the first LEID scatter 
in the same pixels in which the source is imaged. An 
accurate model of the instrument should consider that 
the image of the source is distributed on the focal plane 
according to the point spread function of the telescope. 
Instead, in our toy model we just choose a circular beam 
distributed uniformly in the four central pixels of the 
detector. Therefore we select only those events which 
release at least an energy deposit in one of these pixels 
and/or in their adjacent ones. Secondly, we imposed a 
criterion on the number of hit pixels (hit pixels filter). 
We select only those events for which there is only one 
energy deposit in the LEID, which must be in a location 
compatible with the spatial filter, and only one hit pixel 
in the HEID. The second condition must be applied with 
some attention because we experienced that the charge 
produced by a photoabsorption event may be spread over 
a cluster of several non-contiguous pixels, mainly because 
fluorescence photons may be absorbed at some distance 
from the photoabsorption point. We treat such a cluster 
as a unique pixel with a charge equal to the total energy 
deposit and a position weighted with the energy collected 
in each pixel of the cluster. Operatively, we consider 
that all pixels less distant than 10 pixels belong to the 
same cluster. Although such a large summation radius 
may imply some drawbacks on a real device, especially 
to remove the particle background, large clusters must 
be correctly included in the analysis to avoid systemat- 
ics on the modulation curve due to the square pattern 
of the pixels (see Figure 8). A third filter is applied on 
the energy collected on the HEID, which must be higher 
than 5 keV (HEID energy filter). Such a filter has not a 
large impact on the efficiency (—3% at 30 keV) or on the 
modulation factor (variation below the statistical error). 
Notwithstanding, we imposed a energy threshold for the 
HEID events in order to remove those double events in 
which the photon is absorbed in the LEID and the fluo- 
rescence of Silicon at -1.7 keV in the HEID. The 5 keV 
value is reasonable for current CdTe detectors. The last 
filter we defined, called 9 -filter, is a filter on the scat- 
tering angle, measured by the position of the pixels hit 
in the two detectors. We used the same limits derived 
in Section 3.3, that is we restricted the scattering angle 
between 30 and 52°. 

The spectrum obtained for the LEID, the HEID and by 
summing event by event the energy deposits in the two 
detection planes is shown in Figure 5 for 30 keV incident 
photons. The spectrum including all events is reported 
as a gray histogram, while that obtained after applying 
the filters discussed above is in black. A lot of prominent 
lines are visible and correspond to the escape peaks or 
to the absorption of fluorescence lines from Silicon (K- 
lines at —1.7 keV), Cadmium (K-lines at —23-26 keV, L 
lines at —3 keV) or Tellurium (K-lines at —27-31 keV, 
only visible for photons above —32 keV, and L lines at 
—4 keV). Good events are basically characterized by a 
small energy deposit El in the LEID (a few keV) and by 
an energy deposit in the HEID equal to E — El, where 
E is the energy of the incident photons. These events 
fill in the total spectrum the energy bin around 30 keV 
and represent by far the main components after apply- 
ing the filters (black histogram in Figure 5). Another 
class of good events are those which deposit in the HEID 
an energy equal to E — El but the fluorescence photon, 



which is emitted with high probability (>80% for K-shell 
photoabsorption and high-Z material like Cadmium and 
Tellurium), escapes from the detector. In this case the 
energy detected in the HEID is E — E L — Ef, where Ef 
is the energy of the fluorescence photon, and therefore 
these events make up an escape peak in the total spec- 
trum at energy E — Ef. In Figure 5 two of such escape 
peaks are visible at about 7 keV, which correspond to the 
escape of Cd K Q i and Cd K a2 fluorescence photons with 
energy 23.2 keV and 23.0 keV, respectively. It is worth 
mentioning that the application of the ^-filter has the 
positive effect to make our results less dependent on the 
energy threshold of the LEID. As a matter of fact, the 
removal of events scattered at less of 30° cut out the large 
part of the events which deposit in the LEID only a few 
hundreds of eV (see the inset in top panel of Figure 5). 
For the sake of completeness, we also show in Figure 6 
the spectrum for 90 keV incident photons. The main 
features remain the same as those discussed in the case 
of 30 keV radiation, although in this case it is visible a 
low energy bump in the HEID and in the total spectrum 
which corresponds to photons which, after the scattering 
in the LEID, scatter a second time in the HEID. Such 
events are still good to measure polarization, although 
their statistical weight is negligible with respect to the 
events which are absorbed after the first scattering. 

Events which pass all the filters described above are 
used to build a modulation curve which is fitted with the 
function 

M(p) = A + B ■ cos(<p- <p ) ■ (19) 

The modulation factor /i is derived as usual by (cfr. 
Equation 2) 

_ A4max - Mmin _ B 
^ M max 

4.2. Results 

We performed simulations at 20, 30, 40, 50, 60, 70, 80, 
90 keV and 100 keV, generating 50 • 10 6 photons for each 
energy. Taking as an example the simulation at 30 keV, 
we detected 1.2 • 10 6 double events, that are for the large 
part events in which the photon is absorbed in the HEID 
and some fluorescence in the LEID. Among them, 162 • 
10 3 events were selected by the spatial filter, 156 • 10 3 
passed both the spatial and the hit pixels filter, 151 • 10 3 
passed also the HEID energy filter and eventually 93 • 10 3 
passed also the 6>-filter. We repeated the simulations for 
three different values of the angle of polarization, that are 
0°, 25° and 90°, to check the consistency of our results. 
In this respect, the first and the last values represent the 
two extreme conditions, while 25° is just an intermediate 
value not in resonance with any expected on a square 
pattern. 

Examples of the typical modulation curves obtained 
from Monte Carlo are shown in Figure 7 in the case of 
30 keV photons and a polarization angle of 0° or 25°. 
The fit with the square-cosine function is very good and 
the reduced \ 2 is 1.067 and 1.042 respectively for 97 de- 
grees of freedom. We report in the same Figure as the 
gray histogram also the modulation as it would appear 
without applying the ^-filter. In this case a strong sys- 
tematic effects appears in the modulation curve which is 
simply due to the fact that the HEID detection plane is 
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Figure 5. Spectrum measured by the LEID (top) and by the HEID (middle) for 30 keV incident photons. The total spectrum, obtained 
by summing the energy collected by the two instruments event by event, is in the bottom panel. The inset in the top panel shows a zoom 
at low energy of the LEID spectrum. In black and gray are reported the spectrum with and without the filters discussed in the text to 
distinguish real scattering events, including the 0-filter. The diagonally dashed region in the middle panel shows the energy threshold 
imposed for the HEID with the energy filter. 



square. As a matter of fact, in each bin of the modula- 
tion curve there is the number of events scattered in a 
certain azimuthal angular bin regardless the scattering 
angle. Without the 0-filter, the scattering angle is only 
constrained by the geometry of the two detection planes 
because the photon must cross the HEID to be detected. 
However, the detection planes are square, and this im- 
plies that the interval of accepted scattering angles is 
larger in the diagonal directions thus causing a bump in 
the number of collected events. 

Although we neglected all of the main causes which 
may induce a systematic modulation for a real instru- 
ment, it is interesting to look at the behavior of the mod- 
ulation curve for completely unpolarized photons. This 
is reported in Figure 8 for 90 keV photons, in which as 
above the black and gray histograms are those obtained 
with or without applying the ^-filter, respectively. As ex- 
pected, the detected modulation is consistent with zero 
with a 33% confidence level. However, it is also interest- 
ing to see what would be the result of our analysis in case 
of slightly different filter parameters. When we described 
the hit pixels filter in the Section 4.1, we mentioned that 
a large summation radius, that was 10 pixels, is required 
to avoid systematic effects on the modulation curve. In 
the lower panel of Figure 8 we show the modulation curve 



obtained by exactly the same data as in the upper panel, 
but with the only difference that the summation radius is 
1.6 pixels instead of 10. In this case only pixels which are 
contiguous on the side or on the diagonal are considered 
as a unique cluster and therefore only events involving 
a single cluster of contiguous pixels pass the hit pixels 
filter. The result is that a strong systematics due to the 
square pattern of the pixels emerges with a peak to peak 
variation of ~10%. Although formally the square-cosine 
modulation is still consistent with unpolarized radiation 
at a 15% confidence level, this is only due to the high 
symmetry of the four peaks. In actual devices also small 
nonuniformities would cause a much higher modulation 
and in the most extreme situation the spurious polariza- 
tion, obtained by dividing the peak to peak variation by 
the modulation factor, would be <~10%//i w 30%. As a 
matter of fact, spurious modulated signals can arise from 
minor effects even in apparent symmetrical geometries. 

In Figure 9, the modulation factor, the efficiency and 
the quality factor obtained from Monte Carlo are com- 
pared to the values derived by the analytical treatment 
discussed in Section 3. The efficiency is calculated as 
the ratio between the number of events which passed 
all filters and that of incident photons. The agreement 
is in general rather good, although there are some de- 
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Figure 6. The same as Figure 5 but for 90 keV photons. 

viations with respect to the expected dependence. In 
particular, the modulation factor is larger and the effi- 
ciency is smaller than expected, with a difference which 
decreases with energy. This disagreement is easily ex- 
plainable in view of the fact that in Monte Carlo simu- 
lations we treated Compton scattering more accurately 
by including the scattering function. As a matter of fact, 
this has the effect to suppress the forward scattering, es- 
pecially at low scattering angles and at low energy. Only 
forward scatterings are collected in the stacked imager 
layout and therefore an efficiency reduction is a straight- 
forward consequence of introducing the scattering func- 
tion. This can as well explain the increase of the mod- 
ulation factor because the suppression is more effective 
at low scattering angles, where the intrinsic modulation 
with polarization is lower. This interpretation is con- 
firmed by studying the scattering angle distribution of 
the events which passed the filters. At low energy (see 
Figure 10, left panel) there is a significant difference be- 
tween the distribution obtained from Monte Carlo and 
that expected from the Klcin-Nishina formula, especially 
at low scattering angles. This explains why the estimates 
of the modulation factor and of the efficiency obtained 
analytically and with the Monte Carlo are different. In- 
stead the results are more in agreement at higher energy, 
where the two distributions differ less (see Figure 10, 
right panel). 

Monte Carlo simulations allow us to confirm another 



result obtained in the analytical treatment presented in 
Section 3, that is the maximum of the sensitivity for 
#min ~ 30° (see Equation 17). In Figure 11 we show 
the quality factor as a function of 9 min for 6* max = 52° 
and 30 keV photons. The dependence obtained from the 
simulations is closely related with the expected one, with 
a small offset which can be again ascribed to the inclusion 
of the scattering function in the simulations. 

5. CONCLUSIONS 

The primary aim of this work was to assess the in- 
trinsic sensitivity to polarization of the stacked imager 
design more than to discuss the feasibility and the per- 
formance of a particular implementation. In this respect, 
the results of our analytical treatment and of Monte 
Carlo Geant4 simulations are in full agreement. The 
first noteworthy result is the discovery of a peak in the 
quality factor, a quantity assumed to be representative 
of the polarization sensitivity, for a particular range of 
accepted scattering angles. It is worth mentioning that 
this non-obvious result firstly came out by our analytical 
treatment. This suggests that an optimization of Comp- 
ton scattering polarimeters can (and should) be done 
also with elementary considerations based on the Klein- 
Nishina cross section. Unfortunately, our main result is 
that the stacked imager design has a very low sensitiv- 
ity to polarization despite our optimistic assumptions. 
We are far from an optimized layout and some improve- 
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Figure 7. Modulation curve obtained from Monte Carlo simula- 
tions for an angle of polarization 0° (upper panel) or 25° (lower 
panel). The black and gray histograms refer to the modulation 
curve obtained if the 0-filter is applied or not, respectively. The 
black and gray histograms are not to scale. 

ment is still possible with a more refined data analysis. 
For example, in the stacked imager design the appropri- 
ate modulation factor could be associated to each event 
depending on the scattering angle instead of using an 
average value. Also, it would be possible to add more 
detectors encircling the imagers in order to detect pho- 
tons scattered on a large fraction of the solid angle and 
not only in the forward direction. However, all of these 
tricks may improve the sensitivity of a factor of a few at 
maximum while the low sensitivity of the stacked imager 
design is largely limited by the very low probability for 
scattering in the first detection plane. At this regard, we 
do not think that a major improvement is possible since 
our estimates are based on the fundamental physics of 
the Compton scattering. An increase in the scattering 
efficiency, even though not outstanding, would be possi- 
ble only by increasing the Silicon thickness. However this 
would unavoidably worsen the performance of the detec- 
tor as an imager and this may be unacceptable, since the 
main use of such a stacked imaging detector would be 
in any case to image photons on a large energy range. 
In addition, the dependence of the quality factor on the 
efficiency is rather weak and, roughly speaking, doubling 
the efficiency would result in just a 40% increase in qual- 
ity factor. 

Obviously, a certain polarization sensitivity, albeit low, 
is present in any case and the common argument is that 
it can be exploited "for free". However we think that 



Figure 8. Modulation curve obtained for unpolarized radiation 
at 90 keV. Upper panel: Modulation curve obtained from Monte 
Carlo simulations if the 0-filtcr is applied or not (black and gray his- 
togram respectively). The two histograms are not to scale. Lower 
panel: The same as the upper panel, but in the case the summa- 
tion radius is 1.6 instead of 10, that is only contiguous pixels are 
considered as an unique cluster. 

this argument should be discussed with care because the 
choice to use stacked imagers to detect scattered pho- 
tons poses some requirements which are usually not set 
if the instrument is used just as an imager. For example, 
fast coincidence between the two detection planes is an 
ad-hoc requirement. Specific calibration campaigns with 
polarized and unpolarized radiation would be required 
in order to obtain reliable results and to single out e.g. 
nonuniformities in the pixel response. Dedicated efforts 
would also be needed for data analysis. In the most opti- 
mistic assumption of negligible background, good events 
are still overwhelmed by other double events by an order 
of magnitude, and the application of some of the filters 
that we used, e.g. the hit pixel filter, may be difficult for 
real instruments. Another issue which we intentionally 
disregarded but requires significant efforts when dealing 
with real instruments is the control of the systcmatics 
which may be severe for the stacked imager layout. 

The low sensitivity we estimated is also not encourag- 
ing when it is compared with the performance of dedi- 
cated instruments. For example, the focal plane Comp- 
ton polarimeter discussed by Soffitta et al. (2010) (but 
see also Fabiani et al. 2012 in preparation) has a quality 
factor which is higher than 0.40 at 30 keV and almost 
constant with energy, i.e. at least a factor ~30 higher 
than that of a stacked imager polarimeter. Roughly 
speaking, this means that the same MDP can be reached 
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Figure 9. Comparison of the modulation factor (top), efficiency (middle) and quality factor (bottom) as derived by the analytical 
treatment (dashed line) and by the Monte Carlo simulations (points). The different symbols refer to different values of the incident photons 
angle of polarization, 0°, 25° or 90°. Error bars are shown only for the first value of the polarization angle for graphical reasons, but they 
are representative for the other values too. The small disagreement between the analytic and Monte Carlo results, which decreases with 
energy, is due to the fact that in Monte Carlo simulations we treated more accurately the physics of the Compton scattering. 



with an observation ~ 30 2 = 900 times shorter. This ob- 
viously discourages the efforts to develop and calibrate 
the polarization capabilities of forthcoming stacked im- 
agers and rather suggests to use dedicated instruments. 
A possible policy which could be adopted in case of mis- 
sions with a large telescope is to mount the stacked im- 
ager to perform imaging and a dedicated polarimeter on 
a movable platform and to place alternatively the two in- 
struments in the focus of the telescope. For example, this 
was the strategy adopted for XEUS/IXO. If the science 



objectives of the mission require simultaneous spectral- 
imaging and polarimetric measurements, a solution could 
be to use two different telescopes for the two instruments. 
After all, even used with a telescope with a collecting 
area 10 times smaller than that of the stacked imager, 
the dedicated polarimeter would still require an observa- 
tion time ~90 times shorter to reach the same MDP. 

As a final note, it is worth mentioning that our re- 
sults are not in agreement with those reported by other 
authors. In particular, Gouiffes ct al. (2008) reported 
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Figure 10. Comparison of the scattering angle distribution for the events which passed all the filters in Monte Carlo simulations at two 
energies, 30 keV (left) and 90 keV (right). The distributions are compared with that expected on the basis of the Klcin-Nishina formula 
used in the analytical treatment (dashed line). The difference between the two distributions originate the small disagreement visible in 
Figure 9. The diagonally dashed areas are the regions which are cut out by the 0-filter. 
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